National Repository of Grey Literature 5 records found  Search took 0.01 seconds. 
Gender recognition from the text data
Mačát, Jakub ; Burda, Karel (referee) ; Červenec, Radek (advisor)
This bacheor`s work is focused on gender identification from a text just from an e-mail`s form and also contemporary techniques of data mining and text mining. The technique`s advantages and disadvantages and options of use. There was realized a program for recognizing gender in Java. In a program Rapid Miner is demostrated processing various learning methods. By both programs thete are described their basic attributes, used methods and operators used in the implementation. The programs were tested ona real data. Then there are mentioned methods for program`s extends. eventually there are given examples as the programs process stated assignment.
Effect of HFS Based Feature Selection on Cluster Analysis
Malásek, Jan ; Klusáček, Jan (referee) ; Honzík, Petr (advisor)
Master´s thesis is focused on cluster analysis. Clustering has its roots in many areas, including data mining, statistics, biology and machine learning. The aim of this thesis is to elaborate a recherche of cluster analysis methods, methods for determining number of clusters and a short survey of feature selection methods for unsupervised learning. The very important part of this thesis is software realization for comparing different cluster analysis methods focused on finding optimal number of clusters and sorting data points into correct classes. The program also consists of feature selection HFS method implementation. Experimental methods validation was processed in Matlab environment. The end of master´s thesis compares success of clustering methods using data with known output classes and assesses contribution of feature selection HFS method for unsupervised learning for quality of cluster analysis.
Gender recognition from the text data
Mačát, Jakub ; Burda, Karel (referee) ; Červenec, Radek (advisor)
This bacheor`s work is focused on gender identification from a text just from an e-mail`s form and also contemporary techniques of data mining and text mining. The technique`s advantages and disadvantages and options of use. There was realized a program for recognizing gender in Java. In a program Rapid Miner is demostrated processing various learning methods. By both programs thete are described their basic attributes, used methods and operators used in the implementation. The programs were tested ona real data. Then there are mentioned methods for program`s extends. eventually there are given examples as the programs process stated assignment.
Effect of HFS Based Feature Selection on Cluster Analysis
Malásek, Jan ; Klusáček, Jan (referee) ; Honzík, Petr (advisor)
Master´s thesis is focused on cluster analysis. Clustering has its roots in many areas, including data mining, statistics, biology and machine learning. The aim of this thesis is to elaborate a recherche of cluster analysis methods, methods for determining number of clusters and a short survey of feature selection methods for unsupervised learning. The very important part of this thesis is software realization for comparing different cluster analysis methods focused on finding optimal number of clusters and sorting data points into correct classes. The program also consists of feature selection HFS method implementation. Experimental methods validation was processed in Matlab environment. The end of master´s thesis compares success of clustering methods using data with known output classes and assesses contribution of feature selection HFS method for unsupervised learning for quality of cluster analysis.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.